Towards automatic configuration of big data processing systems

نویسندگان

  • Muhammad Bilal
  • Marco Canini
چکیده

The goal of this research is to develop a generic framework for automatic configuration optimization of big data processing systems. The framework will allow users to specify their performance goals as well as their own metrics. We plan on utilizing system developer provided hints for searching suitable configurations for big data application deployments. Introduction Big data processing systems have dozens of available configuration parameters and system performance crucially depends on tuning several of these parameters. Moreover, the suitable configurations vary depending upon the available resources, workloads and applications for which the system is being used. Today, optimization of these configurations is done manually, possibly requiring several hours of investigation and testing by performance engineers. Performance engineers also need to have detailed knowledge about the big data system under consideration. This makes configuration optimization a tedious and time consuming task. Most of existing works present solutions specific to MapReduce jobs [2,3]. However, to the best of our knowledge the broader research challenge remains open. Preliminary Analysis To understand the variation in performance due to system configurations, we conducted preliminary tests on a multi-node Apache Storm cluster. The setup includes 3 nodes, with the capability to run 96 threads simultaneously. We use the Rolling word count topology of the Intel Storm benchmark [1], with count window size set to 10s and count emit frequency set to 1s. Configuration Possible values Number of workers and acker threads 3-12 (each) Spout, Split Bolt and Count bolt threads 3-64 (each) Table 1 Selected configurations and their values. Following an Experimental Design approach, we use a space filling algorithm to cover the configuration space. Table 1 shows the space of possible configurations considered for each experiment. We have fixed the total number of threads in the topology to 96. We vary the number of threads assigned to each of the topology component while keeping the total number of threads constant. We used the WSP space filling algorithm [4] to generate 77 configurations for our experiments. Figure 1 shows the variation of different performance metrics across different configurations. The per-tuple latency varies between 3ms to 108ms at the median and from 3ms to 577ms at the 99th percentile. In addition, there is a substantial variation in throughput from 10k tuples/s to 117k tuples/s. Thus we observed a variation as high as 2 orders of magnitude in latency and 1 order of magnitude in throughput, given a fixed resource budget. The interplay between parallelism of different Storm components has lead to this drastic variance in performance. The amount of work done by each component of the topology and its processing latency, the number of executor for each bolt/spout and their placement, all play a part in the resulting overall performance. Figure 1: Performance results for selected configurations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sensitometric characteristics of D-, E- and F-speed dental radiographic films in manual and automatic processing

BACKGROUND: The purpose of this study was to evaluate the sensitometric characteristics of Ultraspeed, Ektaspeed Plus and Insight dental radiographic films using manual and automatic processing systems. METHODS: In this experimental invitro study, an aluminum step-wedge was used to construct characteristic curves for D-, E- and F-speed radiographic films (Kodak Eastman, Rochester, USA). All fil...

متن کامل

Design and Test of the Real-time Text mining dashboard for Twitter

One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...

متن کامل

Towards Optimal Execution of Density-based Clustering on Heterogeneous Hardware

Data Clustering is an important and highly utilized data mining technique in various application domains. With ever increasing data volumes in the era of big data, the efficient execution of clustering algorithms is a fundamental prerequisite to gain understanding and acquire novel, previously unknown knowledge from data. To establish an efficient execution, the clustering algorithms have to be...

متن کامل

Natural Language Processing Methods Used for Automatic Prediction Mechanism of Related Phenomenon

The paper presents an idea to combine variety of Natural Language Processing techniques with different classification methods as a tool for automatic prediction mechanism of related phenomenon. Different types of preprocessing techniques are used and verified, in order to find the best set of them. It is assumed that such approach allows to recognize the phenomenon which is related to the text....

متن کامل

On Configuring Distributed Memory Process Grids for Tensor Contraction Applications

Tensor contractions are important computations for many domains including quantum chemistry and big-data applications. Distributed-memory architectures, architectures where numerous processing elements have private memory, are used in order to solve large problems efficiently. On such systems, achieving high performance computation is dependent on the arrangement of the processing elements. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017